Efficient Distributed Linear Classification Algorithms via the Alternating Direction Method of Multipliers
نویسندگان
چکیده
Linear classification has demonstrated success in many areas of applications. Modern algorithms for linear classification can train reasonably good models while going through the data in only tens of rounds. However, large data often does not fit in the memory of a single machine, which makes the bottleneck in large-scale learning the disk I/O, not the CPU. Following this observation, Yu et al. (2010) made significant progress in reducing disk usage, and their algorithms now outperform LIBLINEAR. In this paper, rather than optimizing algorithms on a single machine, we propose and implement distributed algorithms that achieve parallel disk loading and access the disk only once. Our large-scale learning algorithms are based on the framework of alternating direction methods of multipliers. The framework derives a subproblem that remains to be solved efficiently for which we propose using dual coordinate descent and trust region Newton method. Our experimental evaluations on large datasets demonstrate that the proposed algorithms achieve significant speedup over the classifier proposed by Yu et al. running on a single machine. Our algorithms are faster than existing distributed solvers, such as Zinkevich et al. (2010)’s parallel stochastic gradient descent and Vowpal Wabbit.
منابع مشابه
Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
Many problems of recent interest in statistics and machine learning can be posed in the framework of convex optimization. Due to the explosion in size and complexity of modern datasets, it is increasingly important to be able to solve problems with a very large number of features or training examples. As a result, both the decentralized collection or storage of these datasets as well as accompa...
متن کاملModified Convex Data Clustering Algorithm Based on Alternating Direction Method of Multipliers
Knowing the fact that the main weakness of the most standard methods including k-means and hierarchical data clustering is their sensitivity to initialization and trapping to local minima, this paper proposes a modification of convex data clustering in which there is no need to be peculiar about how to select initial values. Due to properly converting the task of optimization to an equivalent...
متن کاملA Proximal Point Analysis of the Preconditioned Alternating Direction Method of Multipliers
We study preconditioned algorithms of alternating direction method of multipliers type for non-smooth optimization problems. The alternating direction method of multipliers is a popular first-order method for general constrained optimization problems. However, one of its drawbacks is the need to solve implicit subproblems. In various applications, these subproblems are either easily solvable or...
متن کاملSeparable Model Predictive Control via Alternating Direction Method of Multipliers for Large-scale Systems
In this paper, an alternating direction method of multipliers (ADMM) based realtime model predictive control (MPC) algorithm is presented. With the use of indicator function and by introducing extra consensus constraints, the constrained MPC problem can be formulated as a separable MPC problem, which can be computed very efficiently by projected gradient descent ADMM update steps and Riccati re...
متن کاملManaging Photovoltaic Generation Effect On Voltage Profile Using Distributed Algorithm
In this paper, a distributed method for reactive power management in a distribution system has been presented. The proposed method focuses on the voltage rise where the distribution systems are equipped with a considerable number of photovoltaic units. This paper proposes the alternating direction method of multipliers (ADMMs) approach for solving the optimal voltage control problem in a distri...
متن کامل